Recommending language models for search queries based on user profile

ABSTRACT

Systems and methods for a media guidance application that generates results in multiple languages for search queries. In particular, the media guidance application ranks search results according the language model associated with the search result.

BACKGROUND

Voice recognition as a technique for interacting with user devices hasgrown popular. These systems typically depend on recognizing words andphrases spoken by a user. To recognize these words, the devices processreceived voice commands according to the language of the user. However,many users may speak in multiple languages and/or speak a language witha particular accent that may not be recognized by the user device.Moreover, the device may not be able to provide results that arespecific to a particular language or accent.

SUMMARY

Accordingly, systems and methods are described herein for a mediaguidance application that generates results in multiple languages forsearch queries. Thus, users no longer need to worry that a device maynot recognize a particular language or accent, or that the device maynot be able to provide results that are specific to a particularlanguage or to a particular accent. In particular, the media guidanceapplication resolves multiple language barriers by taking automatic andmanual user language settings and applying those settings to a varietyof potential search results.

For example, the media guidance application (e.g., implemented on a userdevice) may automatically apply multiple language models to subsequentsearches in response to determining that a current search producedresults corresponding to those multiple languages. For example, themedia guidance application may receive a first voice query from a user,wherein the user corresponds to a user profile. The media guidanceapplication may apply a first language model, based on the user profile,to the first voice query to identify a first plurality of search resultsbased on the first voice query, wherein the first language modelcorresponds to a first language (e.g., English). The media guidanceapplication may automatically determine that the first search resultcorresponds to a first language (e.g., English) and that a second searchresult corresponds to a second language (e.g., Hindi). In response todetermining that the second search result corresponds to the secondlanguage, the media guidance application may update the user profile toapply a second language model to searches, wherein the second languagemodel corresponds to the second language. The media guidance applicationmay then receive a second voice query from the user, apply both thefirst language model and the second language model, based on the userprofile, to the second voice query to identify a second plurality ofsearch results based on the second voice query, and generate fordisplay, on the display device, the second plurality of search results.Thus, the media guidance application automatically applies multiplelanguage models to subsequent searches in response to determining that acurrent search produced results corresponding to those multiplelanguages.

In another example, the media guidance application may rank searchresults based on how well different interpretations of a voice querymatch different language models, not simply how popular a given searchresult is in each language. The media guidance application (e.g.,implemented on a user device) may receive an audible input voice queryfrom the user, whose user profile is stored in memory. The mediaguidance application may receive a user input indicating a first andsecond language model, which dictate the search results. In response toreceiving the user input, the media guidance application may parse thatinput into a first and second audio segment, based on audible breaksfrom the user. The media guidance application may then determine a firsttext query based on the first language and subsequently rank that textquery. The media guidance application may determine a second text querybased off the second language and rank that text query as well. Themedia guidance application may then generate for display a first searchresult for the first text query and a second search result for thesecond text query, wherein the first search result and the second searchresult are ordered according to the first composite ranking and thesecond composite ranking, respectively. Thus, the media guidanceapplication provides search results ranked based on how well differenttextual interpretations of a voice query match different languagemodels, not simply how popular a given search result is in eachlanguage.

In some aspects, the media guidance application may receive, via anaudio input device, a first voice query from a user, wherein the usercorresponds to a user profile that is stored in memory. For example, themedia guidance application may store a user profile that has informationon the user's tendencies and preferences, in particular the languagesused and/or preferred by the user.

The media guidance application may apply a first language model, basedon the user profile, to the first voice query to identify a firstplurality of search results based on the first voice query, wherein thefirst language model corresponds to a first language. For example, themedia guidance application may determine that the first language settingcorresponds to English (e.g., either by default or based on a prior userselection). Thus, the media guidance application may use an Englishlanguage model for an initial search. To apply a language model, themedia guidance application, in some embodiments, may parse the firstvoice query to identify a first audio segment and a second audiosegment, based on audible breaks in the first voice query, comparing thefirst audio segment to the first set of phonemes to determine a firstresolved word corresponding to the first audio segment, and determininga first text query based on the first resolved word. For example, if thefirst voice query was “Vincent Chase”, the media guidance applicationmay identify a first audio segment as “Vincent” and the second audiosegment “Chase”. The media guidance application may, additionally oralternatively, indicate a first set of phonemes for a first languagemodel used to identify the first plurality of search results. Forexample, any given word in a certain language is a combination ofphonemes. The media guidance application may use the variouscombinations of phonemes of a certain language to resolve the audiosegments into text queries.

The media guidance application may generate for display, on a displaydevice, the first plurality of search results. For example, the userwill be able to view a variety of results based on his or her voiceinput. The results could be several items, such as movies related to agiven actor.

The media guidance application may retrieve, from the memory, firstmetadata for the first search result and second metadata for the secondsearch result. The media guidance application may then automaticallydetermine that the first metadata for the first search result of thefirst plurality of search results indicates that the first search resultcorresponds to a first language, and that the second metadata for thesecond search result of the first plurality of search results indicatesthat the second search result corresponds to a second language. Forexample, the media guidance application may process metadatacorresponding to each search result to determine a language or regiontypically associated with the search result. For example, in response tometadata detected indicating that a movie was filmed in India, producedby the Indian film industry (e.g., Bollywood), or featuring Indianactors, the media guidance application may process metadata to determinea language corresponding to that metadata other than English.

The media guidance application may, in response to determining that thesecond metadata for the second search result of the first plurality ofsearch results indicates that the second search result corresponds tothe second language, update the user profile to apply a second languagemodel to searches, wherein the second language model corresponds to thesecond language. For example, in response to determining that the secondsearch result corresponds to Hindi, the media guidance application mayupdate the user profile to perform searches in both English and Hindi.In some embodiments, the media guidance application may update the userprofile where updating the user profile includes: retrieving a languagesetting from the user profile, comparing a language corresponding to thelanguage setting to the second language, and

determining that the second language does not correspond to the languagecorresponding to the language setting. Alternatively, if the mediaguidance application determines that the first metadata for the firstsearch result of the first plurality of search results indicates thatthe first search result corresponds to the first language, the mediaguidance application maintains the language setting (e.g., listing onlyEnglish) in the user profile.

The media guidance application may receive a second voice query from theuser. For example, now that the user profile has been updated with afirst and second language model, the application can analyze anadditional voice query under the new language preferences (e.g., Englishand Hindi). The media guidance application may apply both the firstlanguage model and the second language model, based on the updated userprofile, to the second voice query to identifying a second plurality ofsearch results based on the second voice query.

The media guidance application may generate for display, on the displaydevice, the second plurality of search results. For example, the secondplurality of search results may include search results identified afterprocessing the voice query in both English and Hindi.

In some embodiments, the media guidance application may update the userprofile after receiving a user confirmation to apply the second languagemodel to searches, wherein a prompt for receiving the user confirmationis generated for display in response to determining that the secondmetadata for the second search result of the first plurality of searchresults indicates that the second search result corresponds to thesecond language. For example, in response to determining that additionallanguages may be used (or may be desired by a user), the media guidanceapplication may prompt the user for confirmation before updating thelanguage settings in the user profile.

In some embodiments, the media guidance application may also adjustdialect settings. For example, the media guidance application mayautomatically determine that the second metadata for the second searchresult of the first plurality of search results also indicates that thesecond search result corresponds to a first dialect of the secondlanguage, and, in response to determining that the second metadata forthe second search result of the first plurality of search results alsoindicates that the second search result corresponds to the first dialectof the second language, update the user profile to apply a thirdlanguage model to searches, wherein the third language model correspondsto the first dialect of the second language.

In some embodiments, the media guidance application may apply the firstlanguage model, based on the user profile, to the first voice query toidentify the first plurality of search results based on the first voicequery, which includes: determining a first numerical ranking for thefirst resolved word in the first language and a second numerical rankingfor a second resolved word in the first language, based on a likelihoodof usage of the first resolved word in the first language and the secondresolved word in the first language, and determining a first compositescore for the first text query based on summing the first numericalranking and the second numerical ranking.

In some embodiments, the media guidance application may further comprisegenerating for display, on the display device, a first search result forthe first text query, wherein the first search result is ordered amongsearch results based on the first composite ranking.

In some aspects, the media guidance application may identify, usingcontrol circuitry, a first audio segment and a second audio segment,based on audible breaks in the first voice query. For example, if theuser says “Priyanka Chopra” into the user device, the media guidanceapplication may determine that “Priyanka” is the first audio segment and“Chopra” is the second segment, based on the audible break in betweenthe two words.

The media guidance application may determine a first resolved word in afirst language and a second resolved word in the first language bycombining a first set of phonemes to the first audio segment and secondaudio segment, respectively, where the first set of phonemes areselected from phonemes of the first language model. For example,“Priyanka” and “Chopra” may each be processed in the first languagemodel, which could be English in this case, respectively. In someembodiments, there may be multiple potential text queries due to thevarious phenome combinations in each language. Since the input is inspeech form, the media guidance application may analyze the segmentsinto resolved words, which are possible forms of the input query.Accordingly, the media guidance application may interpret “Priyanka” as“Preeyanka”, “Priyonka”, etc.

The media guidance application may determine a first numerical rankingfor the first resolved word in the first language and a second numericalranking for the second resolved word in the first language, based on alikelihood of usage of the first resolved word in the first language andthe second resolved word in the first language. For example, the mediaguidance application may rank the different possibilities for the names“Priyanka” and “Chopra” based on each resolved word's prevalence andlikelihood of usage.

The media guidance application may determine a first composite score forthe first text query based on summing the first numerical ranking andthe second numerical ranking. For example, because the audio segments inthe voice query are related, the media guidance application may combinethe first audio segment resolved words with the second audio segmentresolved words for the first language by taking the ranks of the firstand second resolved words and adding them together, giving a totalcomposite score.

The media guidance application may determine a first resolved word in asecond language and a second resolved word in the second language bycombining a second set of phonemes to the first audio segment and secondaudio segment, respectively, where the second set of phonemes areselected from phonemes of the second language model. For example,“Priyanka Chopra” will be analyzed in a second language model, e.g.,Hindi, and the media guidance application may produce a variety ofresolved words.

The media guidance application may determine a third numerical rankingfor the first resolved word in the second language and a fourthnumerical ranking for the second resolved word in the second languagebased on a likelihood of usage of the first resolved word in the secondlanguage and the second resolved word in the second language. Forexample, the media guidance application may rank the differentpossibilities for the names “Priyanka” and “Chopra” based on eachresolved word's prevalence and likelihood of usage in the Hindilanguage.

The media guidance application may determine a second composite scorefor a second text query based on summing the third numerical ranking andthe fourth numerical ranking. For example, the media guidanceapplication may produce a composite score based on all the Hindilanguage possibilities for combinations of the resolved first and secondwords.

The media guidance application may generate for display, on the displaydevice, a first search result for the first text query and a secondsearch result for the second text query, wherein the first search resultand the second search result are ordered according to the firstcomposite ranking and the second composite ranking, respectively. Forexample, the composite scores of the resolved word combinations areranked and the highest ranked combinations are outputted for display onthe screen for the user to view. For example, the media guidanceapplication may determine the highest ranked results, based on multiplelanguage models, and the media guidance application may display resultsin multiple languages. In some embodiments, the search results generatedfor display are above a certain threshold of prevalence and likelihoodof usage. For example, if the media guidance application determines thatcombinations are unlikely to constitute correct results (based on thelow likelihood of usage), the media guidance application may notgenerate for display a corresponding result.

In some embodiments, the media guidance application may automaticallyupdate a user profile to add a third language model for a third languagefor determining search results, without user input, in response todetermining that metadata for a third search result indicates that thethird search result corresponds to the third language. For example,based on search results corresponding to a third language, the mediaguidance application may add the third language to the user's profileautomatically.

In some embodiments the media guidance application may update the userprofile after receiving a user confirmation to apply the third languagemodel for determining search results, wherein a prompt for receiving theuser confirmation is generated for display in response to determiningthat the metadata for the third search result indicates that the thirdsearch result corresponds to the third language. For example, the mediaguidance application may receive a user input of a different languagethe user wishes to use for a search; in response, the media guidanceapplication may prompt the user to manually enter that language into theuser profile. Furthermore, in some embodiments, the media guidanceapplication may update the user profile by retrieving the languagesetting from the user profile and comparing it to the language settingof the third language. If the media guidance application determines thatthe language differs from the setting, the media guidance applicationmay update the setting, whereas if the language is the same the settingmay be maintained.

In some embodiments the media guidance application may receive a secondvoice query from the user. For example, in some embodiments, the systemwill identify a third and fourth audio segment based on the audiblebreaks in the second voice query and determining, for a third textquery, a first resolved word in the third language and a second resolvedword in the third language model. In such cases, additional compositescores may be determined by the media guidance application and acomposite score of the composite scores may be used by the mediaguidance application for the ranking.

It should be noted that the systems and/or methods described above maybe applied to or used in accordance with other systems, methods, and/orapparatuses in this disclosure.

BRIEF DESCRIPTION OF THE DRAWINGS

The above and other objects and advantages of the disclosure will beapparent upon consideration of the following detailed description, takenin conjunction with the accompanying drawings, in which like referencecharacters refer to like parts throughout, and in which:

FIG. 1 shows an illustrative example of a display screen generated by amedia guidance application, in accordance with some embodiments of thedisclosure;

FIG. 2 shows another illustrative example of a display screen generatedby a media guidance application, in accordance with some embodiments ofthe disclosure;

FIG. 3 shows a block diagram of an illustrative user equipment device,in accordance with some embodiments of the disclosure;

FIG. 4 shows a block diagram of an illustrative media network, inaccordance with some embodiments of the disclosure;

FIG. 5 is a flowchart of illustrative steps involved in automaticallyupdating language models, in accordance with some embodiments of thedisclosure;

FIG. 6 is a flowchart of illustrative steps providing search resultsfrom searches in multiple languages, in accordance with some embodimentsof the disclosure;

FIG. 7 is a flowchart of illustrative steps for whether to update a userprofile or not, in accordance with some embodiments of the disclosure;and

FIG. 8 is another flowchart of illustrative steps for whether to updatea user profile or not, in accordance with some embodiments of thedisclosure.

DETAILED DESCRIPTION

Systems and methods are described herein for a media guidanceapplication that generates results in multiple languages for searchqueries. Thus, users no longer need to worry that a device may notrecognize a particular language or accent, or that the device may not beable to provide results that are specific to a particular language or toa particular accent. In particular, the media guidance applicationresolves multiple language barriers by taking automatic and manual userlanguage settings and applying those settings to a variety of potentialsearch results.

As referred to herein, a media guidance application is an applicationwhich provides an interface that allows users to efficiently navigatemedia selections and easily identify media that they may desire. A mediaguidance application is referred to herein as an interactive mediaguidance application or, sometimes, a media guidance application or aguidance application. The media guidance application may use languagemodeling in speech recognition, machine translation, part-of-speechtagging, parsing, handwriting recognition, information retrieval, andother applications such as searching through media content in responseto audio queries.

Interactive media guidance applications may take various forms dependingon the media for which they provide guidance. One typical type of mediaguidance application is an interactive television program guide.Interactive television program guides (sometimes referred to aselectronic program guides) are well-known guidance applications that,among other things, allow users to navigate among and locate many typesof media content including conventional television programming (providedvia traditional broadcast, cable, satellite, Internet, or other means),as well as pay-per-view programs, on-demand programs (as invideo-on-demand (VOD) systems), Internet content (e.g., streaming media,downloadable media, Webcasts, etc.), and other types of media or videocontent. Guidance applications also allow users to navigate among andlocate content related to the video content including, for example,video clips, articles, advertisements, chat sessions, games, etc.Guidance applications also allow users to navigate among and locatemultimedia content. The term multimedia is defined herein as media andcontent that utilizes at least two different content forms, such astext, audio, still images, animation, video, and interactivity contentforms. Multimedia content may be recorded and played, displayed oraccessed by information content processing devices, such as computerizedand electronic devices, but can also be part of a live performance. Itshould be understood that the invention embodiments that are discussedin relation to media content are also applicable to other types ofcontent, such as video, audio and/or multimedia.

With the advent of the Internet, mobile computing, and high-speedwireless networks, users are accessing media on personal computers (PCs)and other devices on which they traditionally did not, such as hand-heldcomputers, personal digital assistants (PDAs), mobile telephones, orother mobile devices. On these devices users are able to navigate amongand locate the same media available through a television. Consequently,media guidance is necessary on these devices, as well. The guidanceprovided may be for media content available only through a television,for media content available only through one or more of these devices,or for media content available through both a television and one or moreof these devices. The media guidance applications may be provided ason-line applications (i.e., provided on a website), or as stand-aloneapplications for clients on hand-held computers, PDAs, mobiletelephones, or other mobile devices. The various devices and platformsthat may implement media guidance applications are described in moredetail below.

For example, the media guidance application (e.g., implemented on a userdevice) may automatically apply multiple language models to subsequentsearches in response to determining that a current search producedresults corresponding to those multiple languages. As defined herein, alanguage model is a statistical model used to estimate the likelihood ofdifferent phrases by utilizing a probability distribution over asequence of words. The language model may be a unigram language model,n-gram language model, exponential language model, neural languagemodel, positional language model, etc.

For example, the media guidance application may receive a first voicequery from a user, wherein the user corresponds to a user profile. Asdefined herein a user profile is a set of data stored in memory and itcontains the user's preferences and settings. The media guidanceapplication may be personalized based on a user's preferences andsettings. A personalized media guidance application allows a user tocustomize displays and features to create a personalized “experience”with the media guidance application. This personalized experience may becreated by allowing a user to input these customizations and/or by themedia guidance application monitoring user activity to determine varioususer preferences. Users may access their personalized guidanceapplications by logging in or otherwise identifying themselves to theguidance applications. Customizations of the media guidance applicationmay be made in accordance with a user profile. The customizations mayinclude language settings in which to perform searches (as discussedbelow); varying presentation schemes (e.g., color scheme of displays,font size of text, etc.); aspects of media content listings displayed(e.g., only HDTV programming, user-specified broadcast channels based onfavorite channel selections, re-ordering the display of channels,recommended media content, etc.); desired recording features (e.g.,recording or series recordings for particular users, recording quality,etc.); parental control settings; and other desired customizations.

The media guidance application may allow a user to provide user profileinformation or may automatically compile user profile information. Themedia guidance application may, for example, monitor the media the useraccesses and/or other interactions the user may have with the guidanceapplication. Additionally, the media guidance application may obtain allor part of other user profiles that are related to a particular user(e.g., from other web sites on the Internet the user accesses, such aswww.tvguide.com, from other media guidance applications the useraccesses, from other interactive applications the user accesses, from ahandheld device of the user, etc.), and/or obtain information about theuser from other sources that the media guidance application may access.As a result, a user can be provided with a unified guidance applicationexperience across the user's different devices. This type of userexperience is described in greater detail below in connection with FIG.4. Additional personalized media guidance application features aredescribed in greater detail in Ellis et al., U.S. patent applicationSer. No. 11/179,410, filed Jul. 11, 2005, Boyer et al., U.S. patentapplication Ser. No. 09/437,304, filed Nov. 9, 1999, and Ellis et al.,U.S. patent application Ser. No. 10/105,128, filed Feb. 21, 2002, whichare hereby incorporated by reference herein in their entireties.

The media guidance application may apply a first language model, basedon the user profile, to the first voice query to identify a firstplurality of search results based on the first voice query, wherein thefirst language model corresponds to a first language (e.g., English).The media guidance application may automatically determine that thefirst search result corresponds to a first language (e.g., English) andthat a second search result corresponds to a second language (e.g.,Hindi). In response to determining that the second search resultcorresponds to the second language, the media guidance application mayupdate the user profile to apply a second language model to searches,wherein the second language model corresponds to the second language.The media guidance application may then receive a second voice queryfrom the user, apply both the first language model and the secondlanguage model, based on the user profile, to the second voice query toidentify a second plurality of search results based on the second voicequery, and generate for display, on the display device, the secondplurality of search results. Thus, the media guidance applicationautomatically applies multiple language models to subsequent searches inresponse to determining that a current search produced resultscorresponding to those multiple languages.

In another example, the media guidance application may rank searchresults based on how well different interpretations of a voice querymatch different language models, not simply how popular a given searchresult is in each language. The media guidance application (e.g.,implemented on a user device) may receive an audible input voice queryfrom the user, whose user profile is stored in memory. The mediaguidance application may receive a user input indicating a first andsecond language model, which dictate the search results. In response toreceiving the user input, the media guidance application may parse thatinput into a first and second audio segment, based on audible breaksfrom the user. The media guidance application may then determine a firsttext query based off the first language and subsequently rank that textquery. The media guidance application may determine a second text querybased on the second language and rank that text query as well. The mediaguidance application may then generate for display a first search resultfor the first text query and a second search result for the second textquery, wherein the first search result and the second search result areordered according to the first composite ranking and the secondcomposite ranking, respectively. Thus, the media guidance applicationprovides search results rank based on how well different textualinterpretations of a voice query match different language models, notsimply how popular a given search result is in each language.

For example, if the audible input query from the user is “SophiaVergara” and the first language model is English and the second languagemodel is Spanish, then the media guidance application will generate afirst search result in English and a second search result in Spanish.The order of the various “Sophia Vergara” search results will be basedon a composite ranking of each result.

In some embodiments, the media guidance application may receive, via anaudio input device, a first voice query from a user, wherein the usercorresponds to a user profile that is stored in memory. For example, themedia guidance application may store a user profile that has informationon the user's tendencies and preferences, in particular the languagesused and/or preferred by the user.

The media guidance application may apply a first language model, basedon the user profile, to the first voice query to identify a firstplurality of search results based on the first voice query, wherein thefirst language model corresponds to a first language. For example, themedia guidance application may determine that the first language settingcorresponds to English (e.g., either by default or based on a prior userselection). Thus, the media guidance application may use an Englishlanguage model for an initial search. To apply a language model, themedia guidance application, in some embodiments, may parse the firstvoice query to identify a first audio segment and a second audio segmentbased on audible breaks in the first voice query, comparing the firstaudio segment to the first set of phonemes to determine a first resolvedword corresponding to the first audio segment, and determine a firsttext query based on the first resolved word.

As defined herein, a phoneme is any of several distinct units of soundin a certain language that distinguish words from other words. Forexample, p, b, and d in the English words, pad, pat, and bad, arephonemes that distinguish one word from another. For example, if thefirst voice query was “Vincent Chase”, the media guidance applicationmay identify a first audio segment as “Vincent” and the second audiosegment “Chase”. The media guidance application may, additionally oralternatively, indicate a first set of phonemes for a first languagemodel used to identify the first plurality of search results. Forexample, any given word in a certain language is a combination ofphonemes. The media guidance application may use the variouscombinations of phonemes of a certain language to resolve the audiosegments into text queries.

The media guidance application may generate for display, on a displaydevice, the first plurality of search results. For example, the userwill be able to view a variety of results based on his or her voiceinput. The results could be several items, such as movies related to agiven actor.

The media guidance application may retrieve, from the memory, firstmetadata for the first search result and second metadata for the secondsearch result. The media guidance application may then automaticallydetermine that the first metadata for the first search result of thefirst plurality of search results indicates that the first search resultcorresponds to a first language, and that the second metadata for thesecond search result of the first plurality of search results indicatesthat the second search result corresponds to a second language. Forexample, the media guidance application may process metadatacorresponding to each search result to determine a language or regiontypically associated with the search result. For example, in response tometadata detected indicating that a movie was filmed in France, producedby the French film industry (e.g., in Paris), or featuring Frenchactors, the media guidance application to determine a languagecorresponding to that metadata other than English.

The media guidance application may, in response to determining that thesecond metadata for the second search result of the first plurality ofsearch results indicates that the second search result corresponds tothe second language, update the user profile to apply a second languagemodel to searches, wherein the second language model corresponds to thesecond language. For example, in response to determining that the secondsearch result corresponds to French, the media guidance application mayupdate the user profile to perform searches in both English and French.In some embodiments, the media guidance application updating the userprofile includes: retrieving a language setting from the user profile,comparing a language corresponding to the language setting to the secondlanguage, and determining that the second language does not correspondto the language corresponding to the language setting. Alternatively, ifthe media guidance application determines that the first metadata forthe first search result of the first plurality of search resultsindicates that the first search result corresponds to the firstlanguage, the media guidance application maintains the language setting(e.g., listing only English) in the user profile.

The media guidance application may receive a second voice query from theuser. For example, now that the user profile has been updated with afirst and second language model, the application can analyze anadditional voice query under the new language preferences (e.g., Englishand French). The media guidance application may apply both the firstlanguage model and the second language model, based on the updated userprofile, to the second voice query to identify a second plurality ofsearch results based on the second voice query.

The media guidance application may generate for display, on the displaydevice, the second plurality of search results. For example, the secondplurality of search results may include search results identified afterprocessing the voice query in both English and French.

In some embodiments, the media guidance application may update the userprofile after receiving a user confirmation to apply the second languagemodel to searches, wherein a prompt for receiving the user confirmationis generated for display in response to determining that the secondmetadata for the second search result of the first plurality of searchresults indicates that the second search result corresponds to a secondlanguage. For example, in response to determining that additionallanguages may be used (or may be desired by a user), the media guidanceapplication may prompt the user for confirmation before updating thelanguage settings in the user profile.

In some embodiments, the media guidance application may also adjustdialect settings. For example, the media guidance application mayautomatically determine that the second metadata for the second searchresult of the first plurality of search results also indicates that thesecond search result corresponds to a first dialect of the secondlanguage and, in response to determining that the second metadata forthe second search result of the first plurality of search results alsoindicates that the second search result corresponds to the first dialectof the second language, update the user profile to apply a thirdlanguage model to searches, wherein the third language model correspondsto the first dialect of the second language.

In some embodiment, the media guidance application may apply the firstlanguage model, based on the user profile, to the first voice query toidentify the first plurality of search results based on the first voicequery, which includes: determining a first numerical ranking for thefirst resolved word in the first language and a second numerical rankingfor a second resolved word in the first language based on a likelihoodof usage of the first resolved word in the first language and the secondresolved word in the first language and determining a first compositescore for the first text query based on summing the first numericalranking and the second numerical ranking.

For example, the media guidance application may apply the Englishlanguage model to a voice query of “Edward Norton”. This search querymay have a first resolved word and a second resolved word rankedrespectively.

In some embodiments, the media guidance application may further comprisegenerating for display, on the display device, a first search result forthe first text query, wherein the first search result is ordered amongsearch results based on the first composite ranking.

In some embodiments, the media guidance application may identify, usingcontrol circuitry, a first audio segment and a second audio segment,based on audible breaks in the first voice query. For example, if theuser says “Salma Hayek” into the user device, the media guidanceapplication may determine that “Salma” is the first audio segment and“Hayek” is the second segment, based on the audible break in between thetwo words.

The media guidance application may determine a first resolved word in afirst language and a second resolved word in the first language bycombining a first set of phonemes to the first audio segment and secondaudio segment, respectively, where the first set of phonemes areselected from phonemes of the first language model. For example, “Salma”and “Hayek” may each be processed in the first language model, whichcould be English in this case, respectively. In some embodiments, theremay be multiple potential text queries due to the various phenomecombinations in each language. Since the input is in speech form, themedia guidance application may analyze the segments into resolved words,which are possible forms of the input query. Accordingly, the mediaguidance application may interpret “Salma” as “Sulma”, “Sallma”, etc.

The media guidance application may determine a first numerical rankingfor the first resolved word in the first language and a second numericalranking for the second resolved word in the first language based on alikelihood of usage of the first resolved word in the first language andthe second resolved word in the first language. For example, the mediaguidance application may rank the different possibilities for the names“Salma” and “Hayek” based on each resolved word's prevalence andlikelihood of usage.

The media guidance application may determine a first composite score forthe first text query based on summing the first numerical ranking andthe second numerical ranking. For example, because the audio segments inthe voice query are related, the media guidance application may combinethe first audio segment resolved words with the second audio segmentresolved words for the first language by taking the ranks of the firstand second resolved words and adding them together, giving a totalcomposite score. For example, if “Salma” and “Hayek” was found to berelated, the media guidance application may combine the rank of “Salma”and “Hayek” to get the composite score.

The media guidance application may determine a first resolved word in asecond language and a second resolved word in the second language bycombining a second set of phonemes to the first audio segment and secondaudio segment, respectively, where the second set of phonemes areselected from phonemes of the second language model. For example, “SalmaHayek” may be analyzed in a second language model, e.g., Spanish, andthe media guidance application may produce a variety of resolved words.

The media guidance application may determine a third numerical rankingfor the first resolved word in the second language and a fourthnumerical ranking for the second resolved word in the second languagebased on a likelihood of usage of the first resolved word in the secondlanguage and the second resolved word in the second language. Forexample, the media guidance application may rank the differentpossibilities for the names “Salma” and “Hayek” based on each resolvedword's prevalence and likelihood of usage in the Spanish language.

The media guidance application may determine a second composite scorefor a second text query based on summing the third numerical ranking andthe fourth numerical ranking. For example, the media guidanceapplication may produce a composite score based on all the Spanishlanguage possibilities for combinations of the resolved first and secondwords.

The media guidance application may generate for display, on the displaydevice, a first search result for the first text query and a secondsearch result for the second text query, wherein the first search resultand the second search result are ordered according to the firstcomposite ranking and the second composite ranking, respectively. Forexample, the composite scores of the resolved word combinations areranked and the highest ranked combinations are outputted for display onthe screen for the user to view. For example, the media guidanceapplication may determine the highest ranked results, based on multiplelanguage models, and the media guidance application may display resultsin multiple languages. In some embodiments, the search results generatedfor display are above a certain threshold of prevalence and likelihoodof usage. For example, if the media guidance application determines thatcombinations are unlikely to constitute correct results (based on thelow likelihood of usage), the media guidance application may notgenerate for display corresponding results.

In some embodiments, the media guidance application may automaticallyupdate a user profile to add a third language model for a third languagefor determining search results, without user input, in response todetermining that metadata for a third search result indicates that thethird search result corresponds to the third language. For example,based on search results corresponding to a third language, the mediaguidance application may add the third language to the user's profileautomatically. For example, a user's speech may suggest that in additionto having English and Spanish in his or her profile, French may beapplied too. French would be the third language.

In some embodiments, the media guidance application may update the userprofile after receiving a user confirmation to apply the third languagemodel for determining search results, wherein a prompt for receiving theuser confirmation is generated for display in response to determiningthat the metadata for the third search result indicates that the thirdsearch result corresponds to the third language. For example, the mediaguidance application may receive a user input of a different languagethe user wishes to use for a search, or the media guidance applicationmay prompt the user to manually enter that language into the userprofile. Furthermore, in some embodiments, the media guidanceapplication may update the user profile by retrieving the languagesetting from the user profile and comparing it to the language settingof the third language. If the media guidance application determines thatthe language differs from the setting, the media guidance applicationmay update the setting, whereas if the language is the same, the settingmay be maintained.

In some embodiments, the media guidance application may receive a secondvoice query from the user. For example, in some embodiments, the systemwill identify a third and fourth audio segment based on the audiblebreaks in the second voice query and determining, for a third textquery, a first resolved word in the third language model and a secondresolved word in the third language model. In such cases, additionalcomposite scores may be determined by the media guidance application anda composite score of the composite scores may be used by the mediaguidance application for the ranking.

For example, the user may provide a second voice query such as “JeanReno”. The media guidance application may apply the third languagemodel, such as French, to the query.

One of the functions of the media guidance application is to providemedia listings and media information to users. FIGS. 1-2 showillustrative display screens that may be used to provide media guidance,and in particular media listings. The display screens shown in FIGS. 1-2may be implemented on any suitable device or platform. While thedisplays of FIGS. 1-2 are illustrated as full screen displays, they mayalso be fully or partially overlaid over media content being displayed.A user may indicate a desire to access media information by selecting aselectable option provided in a display screen (e.g., a menu option, alistings option, an icon, a hyperlink, etc.) or pressing a dedicatedbutton (e.g., a GUIDE button) on a remote control or other user inputinterface or device. In response to the user's indication, the mediaguidance application may provide a display screen with media informationorganized in one of several ways, such as by time and channel in a grid,by time, by channel, by media type, by category (e.g., movies, sports,news, children, or other categories of programming), or otherpredefined, user-defined, or other organization criteria.

FIG. 1 shows illustrative grid program listings display 100 arranged bytime and channel that also enables access to different types of mediacontent in a single display. Display 100 may include grid 102 with: (1)a column of channel/media type identifiers 104, where each channel/mediatype identifier (which is a cell in the column) identifies a differentchannel or media type available; and (2) a row of time identifiers 106,where each time identifier (which is a cell in the row) identifies atime block of programming. Grid 102 also includes cells of programlistings, such as program listing 108, where each listing provides thetitle of the program provided on the listing's associated channel andtime. With a user input device, a user can select program listings bymoving highlight region 110. Information relating to the program listingselected by highlight region 110 may be provided in program informationregion 112. Region 112 may include, for example, the program title, theprogram description, the time the program is provided (if applicable),the channel the program is on (if applicable), the program's rating, andother desired information.

In addition to providing access to linear programming provided accordingto a schedule, the media guidance application also provides access tonon-linear programming which is not provided according to a schedule.Non-linear programming may include content from different media sourcesincluding on-demand media content (e.g., VOD), Internet content (e.g.,streaming media, downloadable media, etc.), locally stored media content(e.g., video content stored on a digital video recorder (DVR), digitalvideo disc (DVD), video cassette, compact disc (CD), etc.), or othertime-insensitive media content. On-demand content may include bothmovies and original media content provided by a particular mediaprovider (e.g., HBO On Demand providing “The Sopranos” and “Curb YourEnthusiasm”). HBO ON DEMAND is a service mark owned by Time WarnerCompany L.P. et al. and THE SOPRANOS and CURB YOUR ENTHUSIASM aretrademarks owned by the Home Box Office, Inc. Internet content mayinclude web events, such as a chat session or Webcast, or contentavailable on-demand as streaming media or downloadable media through anInternet web site or other Internet access (e.g. FTP).

Grid 102 may provide listings for non-linear programming includingon-demand listing 114, recorded media listing 116, and Internet contentlisting 118. A display combining listings for content from differenttypes of media sources is sometimes referred to as a “mixed-media”display. The various permutations of the types of listings that may bedisplayed that are different than display 100 may be based on userselection or guidance application definition (e.g., a display of onlyrecorded and broadcast listings, only on-demand and broadcast listings,etc.). As illustrated, listings 114, 116, and 118 are shown as spanningthe entire time block displayed in grid 102 to indicate that selectionof these listings may provide access to a display dedicated to on-demandlistings, recorded listings, or Internet listings, respectively. Inother embodiments, listings for these media types may be includeddirectly in grid 102. Additional listings may be displayed in responseto the user selecting one of the navigational icons 120. (Pressing anarrow key on a user input device may affect the display in a similarmanner as selecting navigational icons 120.)

Display 100 may also include video region 122, advertisement 124, andoptions region 126. Video region 122 may allow the user to view and/orpreview programs that are currently available, will be available, orwere available to the user. The content of video region 122 maycorrespond to, or be independent from, one of the listings displayed ingrid 102. Grid displays including a video region are sometimes referredto as picture-in-guide (PIG) displays. PIG displays and theirfunctionalities are described in greater detail in Satterfield et al.U.S. Pat. No. 6,564,378, issued May 13, 2003 and Yuen et al. U.S. Pat.No. 6,239,794, issued May 29, 2001, which are hereby incorporated byreference herein in their entireties. PIG displays may be included inother media guidance application display screens of the presentinvention.

Advertisement 124 may provide an advertisement for media content that,depending on a viewer's access rights (e.g., for subscriptionprogramming), is currently available for viewing, will be available forviewing in the future, or may never become available for viewing, andmay correspond to or be unrelated to one or more of the media listingsin grid 102. Advertisement 124 may also be for products or servicesrelated or unrelated to the media content displayed in grid 102.Advertisement 124 may be selectable and provide further informationabout media content, provide information about a product or a service,enable purchasing of media content, a product, or a service, providemedia content relating to the advertisement, etc. Advertisement 124 maybe targeted based on a user's profile/preferences, monitored useractivity, the type of display provided, or on other suitable targetedadvertisement bases.

While advertisement 124 is shown as rectangular or banner shaped,advertisements may be provided in any suitable size, shape, and locationin a guidance application display. For example, advertisement 124 may beprovided as a rectangular shape that is horizontally adjacent to grid102. This is sometimes referred to as a panel advertisement. Inaddition, advertisements may be overlaid over media content or aguidance application display or embedded within a display.Advertisements may also include text, images, rotating images, videoclips, or other types of media content. Advertisements may be stored inthe user equipment with the guidance application, in a databaseconnected to the user equipment, in a remote location (includingstreaming media servers), or on other storage means or a combination ofthese locations. Providing advertisements in a media guidanceapplication is discussed in greater detail in, for example, Knudson etal., U.S. patent application Ser. No. 10/347,673, filed Jan. 17, 2003,Ward, III et al. U.S. Pat. No. 6,756,997, issued Jun. 29, 2004, andSchein et al. U.S. Pat. No. 6,388,714, issued May 14, 2002, which arehereby incorporated by reference herein in their entireties. It will beappreciated that advertisements may be included in other media guidanceapplication display screens of the present invention.

Options region 126 may allow the user to access different types of mediacontent, media guidance application displays, and/or media guidanceapplication features. Options region 126 may be part of display 100 (andother display screens of the present invention), or may be invoked by auser by selecting an on-screen option or pressing a dedicated orassignable button on a user input device. The selectable options withinoptions region 126 may concern features related to program listings ingrid 102 or may include options available from a main menu display.Features related to program listings may include searching for other airtimes or ways of receiving a program, recording a program, enablingseries recording of a program, setting program and/or channel as afavorite, purchasing a program, or other features. Options availablefrom a main menu display may include search options, VOD options,parental control options, access to various types of listing displays,subscribe to a premium service, edit a user's profile, access a browseoverlay, or other options.

Another display arrangement for providing media guidance is shown inFIG. 2. Video mosaic display 200 includes selectable options 202 formedia content information organized based on media type, genre, and/orother organization criteria. In display 200, television listings option204 is selected, thus providing listings 206, 208, 210, and 212 asbroadcast program listings. Unlike the listings from FIG. 1, thelistings in display 200 are not limited to simple text (e.g., theprogram title) and icons to describe media. Rather, in display 200 thelistings may provide graphical images including cover art, still imagesfrom the media content, video clip previews, live video from the mediacontent, or other types of media that indicate to a user the mediacontent being described by the listing. Each of the graphical listingsmay also be accompanied by text to provide further information about themedia content associated with the listing. For example, listing 208 mayinclude more than one portion, including media portion 214 and textportion 216. Media portion 214 and/or text portion 216 may be selectableto view video in full-screen or to view program listings related to thevideo displayed in media portion 214 (e.g., to view listings for thechannel that the video is displayed on).

The listings in display 200 are of different sizes (i.e., listing 206 islarger than listings 208, 210, and 212), but if desired, all thelistings may be the same size. Listings may be of different sizes orgraphically accentuated to indicate degrees of interest to the user orto emphasize certain content, as desired by the media provider or basedon user preferences. Various systems and methods for graphicallyaccentuating media listings are discussed in, for example, Yates, U.S.patent application Ser. No. 11/324,202, filed Dec. 29, 2005, which ishereby incorporated by reference herein in its entirety.

Users may access media content and the media guidance application (andits display screens described above and below) from one or more of theiruser equipment devices. FIG. 3 shows a generalized embodiment ofillustrative user equipment device 300. More specific implementations ofuser equipment devices are discussed below in connection with FIG. 4.User equipment device 300 may receive media content and data viainput/output (hereinafter “I/O”) path 302. I/O path 302 may providemedia content (e.g., broadcast programming, on-demand programming,Internet content, and other video or audio) and data to controlcircuitry 304, which includes processing circuitry 306 and storage 308.Control circuitry 304 may be used to send and receive commands,requests, and other suitable data using I/O path 302. I/O path 302 mayconnect control circuitry 304 (and specifically processing circuitry306) to one or more communications paths (described below). I/Ofunctions may be provided by one or more of these communications paths,but are shown as a single path in FIG. 3 to avoid overcomplicating thedrawing.

Control circuitry 304 may be based on any suitable processing circuitry306 such as processing circuitry based on one or more microprocessors,microcontrollers, digital signal processors, programmable logic devices,etc. In some embodiments, control circuitry 304 executes instructionsfor a media guidance application stored in memory (i.e., storage 308).In client-server based embodiments, control circuitry 304 may includecommunications circuitry suitable for communicating with a guidanceapplication server or other networks or servers. Communicationscircuitry may include a cable modem, an integrated services digitalnetwork (ISDN) modem, a digital subscriber line (DSL) modem, a telephonemodem, or a wireless modem for communications with other equipment. Suchcommunications may involve the Internet or any other suitablecommunications networks or paths (which is described in more detail inconnection with FIG. 4). In addition, communications circuitry mayinclude circuitry that enables peer-to-peer communication of userequipment devices, or communication of user equipment devices inlocations remote from each other (described in more detail below).

Memory (e.g., random-access memory, read-only memory, or any othersuitable memory), hard drives, optical drives, or any other suitablefixed or removable storage devices (e.g., DVD recorder, CD recorder,video cassette recorder, or other suitable recording device) may beprovided as storage 308 that is part of control circuitry 304. Storage308 may include one or more of the above types of storage devices. Forexample, user equipment device 300 may include a hard drive for a DVR(sometimes called a personal video recorder, or PVR) and a DVD recorderas a secondary storage device. Storage 308 may be used to store varioustypes of media described herein and guidance application data, includingprogram information, guidance application settings, user preferences orprofile information, or other data used in operating the guidanceapplication. Nonvolatile memory may also be used (e.g., to launch aboot-up routine and other instructions).

Control circuitry 304 may include video generating circuitry and tuningcircuitry, such as one or more analog tuners, one or more MPEG-2decoders or other digital decoding circuitry, high-definition tuners, orany other suitable tuning or video circuits or combinations of suchcircuits. Encoding circuitry (e.g., for converting over-the-air, analog,or digital signals to MPEG signals for storage) may also be provided.Control circuitry 304 may also include scaler circuitry for upconvertingand downconverting media into the preferred output format of the userequipment 300. Circuitry 304 may also include digital-to-analogconverter circuitry and analog-to-digital converter circuitry forconverting between digital and analog signals. The tuning and encodingcircuitry may be used by the user equipment to receive and to display,to play, or to record media content. The tuning and encoding circuitrymay also be used to receive guidance data. The circuitry describedherein, including for example, the tuning, video generating, encoding,decoding, scaler, and analog/digital circuitry, may be implemented usingsoftware running on one or more general purpose or specializedprocessors. Multiple tuners may be provided to handle simultaneoustuning functions (e.g., watch and record functions, picture-in-picture(PIP) functions, multiple-tuner recording, etc.). If storage 308 isprovided as a separate device from user equipment 300, the tuning andencoding circuitry (including multiple tuners) may be associated withstorage 308.

A user may control the control circuitry 304 using user input interface310. User input interface 310 may be any suitable user interface, suchas a remote control, mouse, trackball, keypad, keyboard, touch screen,touch pad, stylus input, joystick, voice recognition interface, or otheruser input interfaces. Display 312 may be provided as a stand-alonedevice or integrated with other elements of user equipment device 300.Display 312 may be one or more of a monitor, a television, a liquidcrystal display (LCD) for a mobile device, or any other suitableequipment for displaying visual images. In some embodiments, display 312may be HDTV-capable. Speakers 314 may be provided as integrated withother elements of user equipment device 300 or may be stand-alone units.The audio component of videos and other media content displayed ondisplay 312 may be played through speakers 314. In some embodiments, theaudio may be distributed to a receiver (not shown), which processes andoutputs the audio via speakers 314.

The guidance application may be implemented using any suitablearchitecture. For example, it may be a stand-alone application whollyimplemented on user equipment device 300. In such an approach,instructions of the application are stored locally, and data for use bythe application is downloaded on a periodic basis (e.g., from the VBI ofa television channel, from an out-of-band feed, or using anothersuitable approach). In another embodiment, the media guidanceapplication is a client-server based application. Data for use by athick or thin client implemented on user equipment device 300 isretrieved on-demand by issuing requests to a server remote to the userequipment device 300. In one example of a client-server based guidanceapplication, control circuitry 304 runs a web browser that interpretsweb pages provided by a remote server.

In yet other embodiments, the media guidance application is downloadedand interpreted or otherwise run by an interpreter or virtual machine(run by control circuitry 304). In some embodiments, the guidanceapplication may be encoded in the ETV Binary Interchange Format (EBIF),received by control circuitry 304 as part of a suitable feed, andinterpreted by a user agent running on control circuitry 304. Forexample, the guidance application may be a EBIF widget. In otherembodiments, the guidance application may be defined by a series ofJAVA-based files that are received and run by a local virtual machine orother suitable middleware executed by control circuitry 304. In some ofsuch embodiments (e.g., those employing MPEG-2 or other digital mediaencoding schemes), the guidance application may be, for example, encodedand transmitted in an MPEG-2 object carousel with the MPEG audio andvideo packets of a program.

User equipment device 300 of FIG. 3 can be implemented in system 400 ofFIG. 4 as user television equipment 402, user computer equipment 404,wireless user communications device 406, or any other type of userequipment suitable for accessing media, such as a non-portable gamingmachine. For simplicity, these devices may be referred to hereincollectively as user equipment or user equipment devices. User equipmentdevices, on which a media guidance application is implemented, mayfunction as a standalone device or may be part of a network of devices.Various network configurations of devices may be implemented and arediscussed in more detail below.

User television equipment 402 may include a set-top box, an integratedreceiver decoder (IRD) for handling satellite television, a televisionset, a digital storage device, a DVD recorder, a video-cassette recorder(VCR), a local media server, or other user television equipment. One ormore of these devices may be integrated to be a single device, ifdesired. User computer equipment 404 may include a PC, a laptop, atablet, a WebTV box, a personal computer television (PC/TV), a PC mediaserver, a PC media center, or other user computer equipment. WEBTV is atrademark owned by Microsoft Corp. Wireless user communications device406 may include PDAs, a mobile telephone, a portable video player, aportable music player, a portable gaming machine, or other wirelessdevices.

It should be noted that with the advent of television tuner cards forPC's, WebTV, and the integration of video into other user equipmentdevices, the lines have become blurred when trying to classify a deviceas one of the above devices. In fact, each of user television equipment402, user computer equipment 404, and wireless user communicationsdevice 406 may utilize at least some of the system features describedabove in connection with FIG. 3 and, as a result, include flexibilitywith respect to the type of media content available on the device. Forexample, user television equipment 402 may be Internet-enabled allowingfor access to Internet content, while user computer equipment 404 mayinclude a tuner allowing for access to television programming. The mediaguidance application may also have the same layout on the variousdifferent types of user equipment or may be tailored to the displaycapabilities of the user equipment. For example, on user computerequipment, the guidance application may be provided as a web siteaccessed by a web browser. In another example, the guidance applicationmay be scaled down for wireless user communications devices.

In system 400, there is typically more than one of each type of userequipment device but only one of each is shown in FIG. 4 to avoidovercomplicating the drawing. In addition, each user may utilize morethan one type of user equipment device (e.g., a user may have atelevision set and a computer) and also more than one of each type ofuser equipment device (e.g., a user may have a PDA and a mobiletelephone and/or multiple television sets).

The user may also set various settings to maintain consistent mediaguidance application settings across in-home devices and remote devices.Settings include those described herein, as well as channel and programfavorites, programming preferences that the guidance applicationutilizes to make programming recommendations, display preferences, andother desirable guidance settings. For example, if a user sets a channelas a favorite on, for example, the web site www.tvguide.com on theirpersonal computer at their office, the same channel would appear as afavorite on the user's in-home devices (e.g., user television equipmentand user computer equipment) as well as the user's mobile devices, ifdesired. Therefore, changes made on one user equipment device can changethe guidance experience on another user equipment device, regardless ofwhether they are the same or a different type of user equipment device.In addition, the changes made may be based on settings input by a user,as well as user activity monitored by the guidance application.

The user equipment devices may be coupled to communications network 414.Namely, user television equipment 402, user computer equipment 404, andwireless user communications device 406 are coupled to communicationsnetwork 414 via communications paths 408, 410, and 412, respectively.Communications network 414 may be one or more networks including theInternet, a mobile phone network, mobile device (e.g., Blackberry)network, cable network, public switched telephone network, or othertypes of communications network or combinations of communicationsnetworks. BLACKBERRY is a service mark owned by Research In MotionLimited Corp. Paths 408, 410, and 412 may separately or together includeone or more communications paths, such as, a satellite path, afiber-optic path, a cable path, a path that supports Internetcommunications (e.g., IPTV), free-space connections (e.g., for broadcastor other wireless signals), or any other suitable wired or wirelesscommunications path or combination of such paths. Path 412 is drawn withdotted lines to indicate that in the exemplary embodiment shown in FIG.4 it is a wireless path and paths 408 and 410 are drawn as solid linesto indicate they are wired paths (although these paths may be wirelesspaths, if desired).

Communications with the user equipment devices may be provided by one ormore of these communications paths, but are shown as a single path inFIG. 4 to avoid overcomplicating the drawing.

Although communications paths are not drawn between user equipmentdevices, these devices may communicate directly with each other viacommunication paths, such as those described above in connection withpaths 408, 410, and 412, as well other short-range point-to-pointcommunication paths, such as USB cables, IEEE 1394 cables, wirelesspaths (e.g., Bluetooth, infrared, IEEE 802-11x, etc.), or othershort-range communication via wired or wireless paths. BLUETOOTH is acertification mark owned by Bluetooth SIG, INC. The user equipmentdevices may also communicate with each other directly through anindirect path via communications network 414.

System 400 includes media content source 416 and media guidance datasource 418 coupled to communications network 414 via communication paths420 and 422, respectively. Paths 420 and 422 may include any of thecommunication paths described above in connection with paths 408, 410,and 412.

Communications with the media content source 416 and media guidance datasource 418 may be exchanged over one or more communications paths, butare shown as a single path in FIG. 4 to avoid overcomplicating thedrawing. In addition, there may be more than one of each of mediacontent source 416 and media guidance data source 418, but only one ofeach is shown in FIG. 4 to avoid overcomplicating the drawing. (Thedifferent types of each of these sources are discussed below.) Ifdesired, media content source 416 and media guidance data source 418 maybe integrated as one source device. Although communications betweensources 416 and 418 with user equipment devices 402, 404, and 406 areshown as through communications network 414, in some embodiments,sources 416 and 418 may communicate directly with user equipment devices402, 404, and 406 via communication paths (not shown) such as thosedescribed above in connection with paths 408, 410, and 412.

Media content source 416 may include one or more types of mediadistribution equipment including a television distribution facility,cable system headend, satellite distribution facility, programmingsources (e.g., television broadcasters, such as NBC, ABC, HBO, etc.),intermediate distribution facilities and/or servers, Internet providers,on-demand media servers, and other media content providers. NBC is atrademark owned by the National Broadcasting Company, Inc., ABC is atrademark owned by the ABC, INC., and HBO is a trademark owned by theHome Box Office, Inc. Media content source 416 may be the originator ofmedia content (e.g., a television broadcaster, a Webcast provider, etc.)or may not be the originator of media content (e.g., an on-demand mediacontent provider, an Internet provider of video content of broadcastprograms for downloading, etc.). Media content source 416 may includecable sources, satellite providers, on-demand providers, Internetproviders, or other providers of media content. Media content source 416may also include a remote media server used to store different types ofmedia content (including video content selected by a user), in alocation remote from any of the user equipment devices. Systems andmethods for remote storage of media content, and providing remotelystored media content to user equipment are discussed in greater detailin connection with Ellis et al., U.S. patent application Ser. No.09/332,244, filed Jun. 11, 1999, which is hereby incorporated byreference herein in its entirety.

Media guidance data source 418 may provide media guidance data, such asmedia listings, media-related information (e.g., broadcast times,broadcast channels, media titles, media descriptions, ratingsinformation (e.g., parental control ratings, critic's ratings, etc.),genre or category information, actor information, logo data forbroadcasters' or providers' logos, etc.), media format (e.g., standarddefinition, high definition, etc.), advertisement information (e.g.,text, images, media clips, etc.), on-demand information, and any othertype of guidance data that is helpful for a user to navigate among andlocate desired media selections.

Media guidance application data may be provided to the user equipmentdevices using any suitable approach. In some embodiments, the guidanceapplication may be a stand-alone interactive television program guidethat receives program guide data via a data feed (e.g., a continuousfeed, trickle feed, or data in the vertical blanking interval of achannel). Program schedule data and other guidance data may be providedto the user equipment on a television channel sideband, in the verticalblanking interval of a television channel, using an in-band digitalsignal, using an out-of-band digital signal, or by any other suitabledata transmission technique. Program schedule data and other guidancedata may be provided to user equipment on multiple analog or digitaltelevision channels. Program schedule data and other guidance data maybe provided to the user equipment with any suitable frequency (e.g.,continuously, daily, a user-specified period of time, a system-specifiedperiod of time, in response to a request from user equipment, etc.). Insome approaches, guidance data from media guidance data source 418 maybe provided to users' equipment using a client-server approach. Forexample, a guidance application client residing on the user's equipmentmay initiate sessions with source 418 to obtain guidance data whenneeded. Media guidance data source 418 may provide user equipmentdevices 402, 404, and 406 the media guidance application itself orsoftware updates for the media guidance application.

Media guidance applications may be, for example, stand-aloneapplications implemented on user equipment devices. In otherembodiments, media guidance applications may be client-serverapplications where only the client resides on the user equipment device.For example, media guidance applications may be implemented partially asa client application on control circuitry 304 of user equipment device300 and partially on a remote server as a server application (e.g.,media guidance data source 418). The guidance application displays maybe generated by the media guidance data source 418 and transmitted tothe user equipment devices. The media guidance data source 418 may alsotransmit data for storage on the user equipment, which then generatesthe guidance application displays based on instructions processed bycontrol circuitry.

Media guidance system 400 is intended to illustrate a number ofapproaches, or network configurations, by which user equipment devicesand sources of media content and guidance data may communicate with eachother for the purpose of accessing media and providing media guidance.The present invention may be applied in any one or a subset of theseapproaches, or in a system employing other approaches for deliveringmedia and providing media guidance. The following three approachesprovide specific illustrations of the generalized example of FIG. 4.

In one approach, user equipment devices may communicate with each otherwithin a home network. User equipment devices can communicate with eachother directly via short-range point-to-point communication schemesdescribe above, via indirect paths through a hub or other similar deviceprovided on a home network, or via communications network 414. Each ofthe multiple individuals in a single home may operate different userequipment devices on the home network. As a result, it may be desirablefor various media guidance information or settings to be communicatedbetween the different user equipment devices. For example, it may bedesirable for users to maintain consistent media guidance applicationsettings on different user equipment devices within a home network, asdescribed in greater detail in Ellis et al., U.S. patent applicationSer. No. 11/179,410, filed Jul. 11, 2005. Different types of userequipment devices in a home network may also communicate with each otherto transmit media content. For example, a user may transmit mediacontent from user computer equipment to a portable video player orportable music player.

In a second approach, users may have multiple types of user equipment bywhich they access media content and obtain media guidance. For example,some users may have home networks that are accessed by in-home andmobile devices. Users may control in-home devices via a media guidanceapplication implemented on a remote device. For example, users mayaccess an online media guidance application on a web-site via a personalcomputer at their office, or a mobile device such as a PDA orweb-enabled mobile telephone. The user may set various settings (e.g.,recordings, reminders, or other settings) on the online guidanceapplication to control the user's in-home equipment. The online guidemay control the user's equipment directly, or by communicating with amedia guidance application on the user's in-home equipment. Varioussystems and methods for user equipment devices communicating, where theuser equipment devices are in locations remote from each other, isdiscussed in, for example, Ellis et al., U.S. patent application Ser.No. 10/927,814, filed Aug. 26, 2004, which is hereby incorporated byreference herein in its entirety.

In a third approach, users of user equipment devices inside and outsidea home can use their media guidance application to communicate directlywith media content source 416 to access media content.

Specifically, within a home, users of user television equipment 404 anduser computer equipment 406 may access the media guidance application tonavigate among and locate desirable media content. Users may also accessthe media guidance application outside of the home using wireless usercommunications devices 406 to navigate among and locate desirable mediacontent.

It will be appreciated that while the discussion of media content hasfocused on video content, the principles of media guidance can beapplied to other types of media content, such as music, images, etc.

FIG. 5 is a flowchart of illustrative steps for automatically updatinglanguage models in a user profile. It should be noted that process 500or any step thereof could be performed on, or provided by, any of thedevices shown in FIGS. 3-4. For example, process 500 may be executed bycontrol circuitry 304 (FIG. 3) as instructed by a media guidanceapplication implemented on a user device (e.g., user equipment devices402, 404, and/or 406 (FIG. 4)) in order to automatically update languagemodels in a user profile. In addition, one or more steps of process 500may be incorporated into or combined with one or more steps of any otherprocess or embodiment (e.g., as described in relation to FIGS. 7-8)).

At step 502, the media guidance application receives, via an audio inputdevice, a first voice query from a user, where the user corresponds to auser profile that is stored in memory. For example, the media guidanceapplication (e.g., via control circuitry 304 (FIG. 3)) may receive auser dictated actor name, “George Clooney”.

At step 504, the media guidance application applies a first languagemodel, based on the user profile, to the first voice query to identify afirst plurality of search results based on the first voice query,wherein the first language model corresponds to the first language. Forexample, the media guidance application has a first language, such asEnglish, stored in memory (e.g., via control circuitry 304 (FIG. 3))that will be used to produce search results.

At step 506, the media guidance application generates for display (e.g.,via control circuitry 304 (FIG. 3)), on a display device, the firstplurality of search results. For example, the media guidance applicationwill show the user results on a certain display. These results will bein the first language model from the user's profile.

At step 508, the media guidance application retrieves, from the memory(e.g., via storage circuitry 308 (FIG. 3)), first metadata for the firstsearch result and second metadata for the second search result. Forexample, the media guidance application will access the informationrelated to the first search result and the second search result.

At step 510, the media guidance application determines (e.g., viacontrol circuitry 304 (FIG. 3)) that the first metadata for the firstsearch result of the first plurality of search results indicates thatthe first search result corresponds to the first language. For example,the media guidance application may determine that the search results forthe first language are in fact English as indicated in the user profile.

At step 512, the media guidance application, in response to determining(e.g., via control circuitry 304 (FIG. 3)) that the second metadata forthe second search result of the first plurality of search resultsindicates that the second search result corresponds to the secondlanguage, updates the user profile to apply a second language model tosearches, where the second language model corresponds to the secondlanguage. For example, the media guidance application may determine thatthe second language, which could be Hindi, is reflected in the secondsearch result.

At step 514, the media guidance application receives (e.g., via controlcircuitry 304 (FIG. 3)) a second voice query from the user. For example,the media guidance application may receive and additional actor or moviein which the user is interested.

At step 516, the media guidance application applies (e.g., via controlcircuitry 304 (FIG. 3)) both the first language model and the secondlanguage model, based on the updated user profile, to the second voicequery to identify a second plurality of search results based on thesecond voice query. For example, if the profile has the first languagemodel set to English and the second language model to Spanish, the mediaguidance application will apply both language models to produce searchresults.

At step 518, the media guidance application generates for display (e.g.,via control circuitry 302 (FIG. 3)), on the display device, the secondplurality of search results. For example, the media guidance device willshow the second search results from the second voice query.

It is contemplated that the steps or descriptions of FIG. 5 may be usedwith any other embodiment of this disclosure. In addition, the steps anddescriptions described in relation to FIG. 5 may be done in alternativeorders or in parallel to further the purposes of this disclosure. Forexample, each of these steps may be performed in any order or inparallel or substantially simultaneously to reduce lag or increase thespeed of the system or method. Furthermore, it should be noted that anyof the devices or equipment discussed in relation to FIGS. 3-4 could beused to perform one or more of the steps in FIG. 5.

FIG. 6 is a flowchart of illustrative steps for automatically updatinglanguage models in a user profile. It should be noted that process 600or any step thereof could be performed on, or provided by, any of thedevices shown in FIGS. 3-4. For example, process 600 may be executed bycontrol circuitry 304 (FIG. 3) as instructed by a media guidanceapplication implemented on a user device (e.g., user equipment devices402, 404, and/or 406 (FIG. 4)) in order to automatically update languagemodels in a user profile. In addition, one or more steps of process 600may be incorporated into or combined with one or more steps of any otherprocess or embodiment (e.g., as described in relation to FIGS. 7-8)).

At step 602, the media guidance application receives (e.g., via controlcircuitry 302 (FIG. 3)), via an audio input device, a first voice queryfrom a user, wherein the user corresponds to a user profile that isstored in memory (e.g., via memory circuitry 308 (FIG. 3)), and whereinthe user profile indicates a first language model for a first languageand a second language model for a second language for determining searchresults. For example, the media guidance application may have a userprofile with English and Spanish stored as the first language and secondlanguage.

At step 604, the media guidance application identifies a first audiosegment and a second audio segment based on audible breaks in the firstvoice query (e.g., via memory circuitry 302 (FIG. 3)). For example, ifthe first voice query is “Bob Saget”, then the first audio segment is“Bob” and the second audio segment is “Saget”.

At step 606, the media guidance application determines, for a first textquery, a first resolved word in a first language and a second resolvedword in the first language by combining a first set of phonemes to thefirst audio segment and second audio segment, respectively (e.g., viacontrol circuitry 304 (FIG. 3)), wherein the first set of phonemes areselected from phonemes of the first language model. For example, if thefirst language is English, then the media guidance application willapply that language model to “Bob” and “Saget” and provide resolvedwords.

At step 608, the media guidance application determines (e.g., viacontrol circuitry 304 (FIG. 3)) a first numerical ranking for the firstresolved word in the first language and a second numerical ranking forthe second resolved word in the first language based on a likelihood ofusage of the first resolved word in the first language and the secondresolved word in the first language (e.g., via control circuitry 304(FIG. 3)). For example, while English is still the language model, themedia guidance application will rank the resulting resolved words basedon the likelihood of occurrence.

At step 610, the media guidance application determines (e.g., viacontrol circuitry 304 (FIG. 3)) a first composite score for the firsttext query based on summing the first numerical ranking and the secondnumerical ranking (via control circuitry 304 (FIG. 3)). For example, thefirst and second resolved words' scores will be combined for a totalcomposite score.

At step 612, the media guidance application determines (e.g., viacontrol circuitry 304 (FIG. 3)), for a second text query, a firstresolved word in a second language and a second resolved word in thesecond language by combining a second set of phonemes to the first audiosegment and second audio segment, respectively, wherein the second setof phonemes are selected from phonemes of the second language model. Forexample, if the second stored language is Spanish, the media guidanceapplication may apply the second language model to determine first andsecond resolved words.

At step 614, the media guidance application determines (e.g., viacontrol circuitry 304 (FIG. 3)) a third numerical ranking for the firstresolved word in the second language and a fourth numerical ranking forthe second resolved word in the second language, based on a likelihoodof usage of the first resolved word in the second language and thesecond resolved word in the second language. For example, the mediaguidance application may use the second language model to produceresolved words and those resolved words in the second language will thenbe ranked.

At step 616, the media guidance application determines (e.g., viacontrol circuitry 302 (FIG. 4)) a second composite score for a secondtext query based on summing the third numerical ranking and the fourthnumerical ranking. For example, the media guidance application maycombine the third and fourth numerical rankings to gather a compositescore.

At step 618, the media guidance application generates for display (e.g.,via control circuitry 312 (FIG. 3)), on the display device, a firstsearch result for the first text query and a second search result forthe second text query, wherein the first search result and the secondsearch result are ordered according to the first composite ranking andthe second composite ranking, respectively. For example, the mediaguidance application will display for the user the text output of thetext queries.

It is contemplated that the steps or descriptions of FIG. 6 may be usedwith any other embodiment of this disclosure. In addition, the steps anddescriptions described in relation to FIG. 6 may be done in alternativeorders or in parallel to further the purposes of this disclosure. Forexample, each of these steps may be performed in any order or inparallel or substantially simultaneously to reduce lag or increase thespeed of the system or method. Furthermore, it should be noted that anyof the devices or equipment discussed in relation to FIGS. 3-4 could beused to perform one or more of the steps in FIG. 6.

FIG. 7 is a flowchart of illustrative steps for whether to update a userprofile or not. It should be noted that process 700 or any step thereofcould be performed on, or provided by, any of the devices shown in FIGS.3-4. For example, process 700 may be executed by control circuitry 304(FIG. 3) as instructed by a media guidance application implemented on auser device (e.g., user equipment devices 402, 404, and/or 406 (FIG. 4))in order to update a user profile or not. In addition, one or more stepsof process 700 may be incorporated into or combined with one or moresteps of any other process or embodiment (e.g., as described in relationto FIGS. 7-8)).

At step 702, the media guidance application determines (e.g., viacontrol circuitry 304 (FIG. 3)) a language corresponding to a mediaasset. For example, the media guidance application may determine to setthe language to English either from user input or automatically.

At step 704, the media guidance application retrieves (e.g., via controlcircuitry 302 (FIG. 3)) a language setting from user profile (e.g., asstored in storage 308 (FIG. 3) and/or any location accessible viacommunications network 414 (FIG. 4)). For example, the media guidanceapplication may retrieve a language setting of English.

At step 706, the media guidance application assesses (e.g., via controlcircuitry 304 (FIG. 3)) whether the language corresponds to the currentlanguage setting. If the language is the same, the media guidanceapplication will jump to step 710. If the language is different, themedia guidance application will proceed to step 708. For example, ifEnglish is already a stored language in memory, the application will notupdate the user profile.

At step 708, the media guidance application assesses (e.g., via controlcircuitry 304 (FIG. 3)) if a threshold number is met. If the thresholdis met, the media guidance application will proceed to step 712.However, if the threshold is not met, the application will proceed tostep 710. For example, if the English language model does not satisfythe threshold, the application will not update the user profile.

At step 710, the media guidance application receives input (e.g., viacontrol circuitry 302 (FIG. 3)) from previous steps and does not updatethe user profile. For example, if the language chosen is a duplicate,then the application will not update the user profile.

At step 712, the media guidance application receives input (e.g., viacontrol circuitry 302 (FIG. 3)) and updates the user profile. For,example, if the language chosen is unique and meets the threshold, theapplication will update the user profile.

It is contemplated that the steps or descriptions of FIG. 7 may be usedwith any other embodiment of this disclosure. In addition, the steps anddescriptions described in relation to FIG. 7 may be done in alternativeorders or in parallel to further the purposes of this disclosure. Forexample, each of these steps may be performed in any order or inparallel or substantially simultaneously to reduce lag or increase thespeed of the system or method. Furthermore, it should be noted that anyof the devices or equipment discussed in relation to FIGS. 3-4 could beused to perform one or more of the steps in FIG. 7.

FIG. 8 is a flowchart of illustrative steps for whether to update a userprofile or not. It should be noted that process 700 or any step thereofcould be performed on, or provided by, any of the devices shown in FIGS.3-4. For example, process 800 may be executed by control circuitry 304(FIG. 3) as instructed by a media guidance application implemented on auser device (e.g., user equipment devices 402, 404, and/or 406 (FIG. 4))in order to update a user profile or not. In addition, one or more stepsof process 800 may be incorporated into or combined with one or moresteps of any other process or embodiment (e.g., as described in relationto FIGS. 7-8)).

At step 802, the media guidance application automatically updates (e.g.,via control circuitry 304 (FIG. 3)), user profile with search resultscorresponding to the second language model. For example, the applicationmay change the second language to French.

At step 804, the media guidance application assesses (e.g., via controlcircuitry 314 (FIG. 3)) whether the user confirmed the second languagemodel. If the user did confirm the second language the media applicationwill proceed to step 806; however, if the user did not confirm, theapplication will proceed to step 802. For example, the user may notconfirm the second language if it was updated automatically to alanguage not desired by the user.

At step 806, the media guidance application assesses (e.g., via controlcircuitry 304 (FIG. 3)) whether the second language is the same as thefirst. For example, if the second language is updated to French, butFrench is already stored as the first language, the application willproceed to step 808. If the language is something other than French, theapplication will proceed to step 810.

At step 808, the media guidance application may not update (e.g., viacontrol circuitry 302 (FIG. 3)) the user profile. For example, if theupdated language is the same as the first, then the application may notupdate the user profile (e.g., via memory circuitry 308 (FIG. 3)).

At step 810, the media guidance application updates the user profile.For example, if the second language is different from the first, theapplication will update the user profile (via memory circuitry 308 (FIG.3)).

It is contemplated that the steps or descriptions of FIG. 8 may be usedwith any other embodiment of this disclosure. In addition, the steps anddescriptions described in relation to FIG. 8 may be done in alternativeorders or in parallel to further the purposes of this disclosure. Forexample, each of these steps may be performed in any order or inparallel or substantially simultaneously to reduce lag or increase thespeed of the system or method. Furthermore, it should be noted that anyof the devices or equipment discussed in relation to FIGS. 3-4 could beused to perform one or more of the steps in FIG. 7.

1.-51. (canceled)
 52. A method for generating results in multiplelanguages for search queries, comprising: receiving a first voice query;accessing a first language model and a second language model;identifying a first audio segment and a second audio segment based onaudible breaks in the first voice query; determining a first resolvedword in a first language and a second resolved word in the firstlanguage by combining a first set of phonemes to the first audio segmentand second audio segment, respectively, the first set of phonemesselected from phonemes of the first language model; generating a firsttext query including the first resolved word in the first language andthe second resolved word in the first language; determining a firstnumerical ranking for the first resolved word in the first language anda second numerical ranking for the second resolved word in the firstlanguage based on a likelihood of usage of the first resolved word inthe first language and the second resolved word in the first language;determining a first composite score for the first text query based onsumming the first numerical ranking and the second numerical ranking;determining a first resolved word in a second language and a secondresolved word in the second language by combining a second set ofphonemes to the first audio segment and second audio segment,respectively, the second set of phonemes selected from phonemes of thesecond language model; generating a second text query including thefirst resolved word in the second language and the second resolved wordin the second language; determining a third numerical ranking for thefirst resolved word in the second language and a fourth numericalranking for the second resolved word in the second language based on alikelihood of usage of the first resolved word in the second languageand the second resolved word in the second language; determining asecond composite score for the second text query based on summing thethird numerical ranking and the fourth numerical ranking; generating afirst search result for the first text query and a second search resultfor the second text query; ranking the first search result and thesecond search result according to the corresponding first compositeranking and the second composite ranking; and providing the first searchresult and second search result according to the determined order. 53.The method of claim 52, further comprising: determining that metadatafor a third search result indicates that the third search resultcorresponds to a third language; and in response to determining thatmetadata for a third search result indicates that the third searchresult corresponds to the third language, generating a third languagemodel for the third language for determining search results.
 54. Themethod of claim 53, wherein the third language model is generated afterreceiving confirmation to apply the third language model for determiningsearch results, and wherein a prompt for receiving the confirmation isgenerated in response to determining that the metadata for the thirdsearch result indicates that the third search result corresponds to thethird language.
 55. The method of claim 54, wherein generating the thirdlanguage model further comprises: accessing a language setting, thelanguage setting including a setting language; comparing the settinglanguage to the third language; and determining that the third languagedoes not correspond to the setting language.
 56. The method of claim 55,further comprising: receiving a second voice query; identifying a thirdaudio segment and a fourth audio segment based on audible breaks in thesecond voice query; determining a first resolved word in the thirdlanguage and a second resolved word in the third language by combining athird set of phonemes to the third audio segment and fourth audiosegment, respectively, the third set of phonemes selected from phonemesof the third language model; and generating a third text query includingthe first resolved word in the third language and the second resolvedword in the third language.
 57. The method of claim 52, furthercomprising: determining a plurality of potential text queries, using thefirst language model, by applying different combinations of the firstset of phonemes to the first audio segment and second audio segment,respectively; ranking the potential text queries according to theirrespective composite scores; and identifying a highest ranked text queryfrom the potential text queries based on the first composite scorehaving a highest ranking of the respective composite scores.
 58. Themethod of claim 57, further comprising generating a prompt to select thehighest ranked text query if multiple text queries of the potential textqueries share the highest ranking of the respective composite scores.59. The method of claim 58, further comprising: accessing a compositescore threshold; and providing the highest ranked search result for thefirst text query in response to determining that a correspondingcomposite score is above the composite score threshold score.
 60. Themethod of claim 59, further comprising: accessing first metadata for thehighest ranked search result; automatically determining that the firstmetadata for the highest ranked search result indicates that the highestranked search result does not correspond to the first language; and notprovide the highest ranked search result for the first text query inresponse to determining that the first metadata for the highest rankedsearch result indicates that the highest ranked search result does notcorrespond to a first language.
 61. The method of claim 52, furthercomprising: determining, for a third text query, the first resolved wordin the first language and the second resolved word in the secondlanguage by combining the first set of phonemes to the first audiosegment and the second set of phonemes to the second audio segment;determining a fifth numerical ranking for the first resolved word in thefirst language and a sixth numerical ranking for the second resolvedword in the second language based on a likelihood of usage of the firstresolved word in the first language and the second resolved word in thesecond language; and determining a third composite score for the thirdtext query based on summing the fifth numerical ranking and the sixthnumerical ranking.
 62. A system for generating results in multiplelanguages for search queries, comprising: memory configured to store aplurality of language models; first input/output circuitry configuredto: receive a first voice query; access a first language model and asecond language model; and processing circuitry configured to: identifya first audio segment and a second audio segment based on audible breaksin the first voice query; determine a first resolved word in a firstlanguage and a second resolved word in the first language by combining afirst set of phonemes to the first audio segment and second audiosegment, respectively, the first set of phonemes selected from phonemesof the first language model; generate a first text query including thefirst resolved word in the first language and the second resolved wordin the first language; determine a first numerical ranking for the firstresolved word in the first language and a second numerical ranking forthe second resolved word in the first language based on a likelihood ofusage of the first resolved word in the first language and the secondresolved word in the first language; determine a first composite scorefor the first text query based on summing the first numerical rankingand the second numerical ranking; determine a first resolved word in asecond language and a second resolved word in the second language bycombining a second set of phonemes to the first audio segment and secondaudio segment, respectively, the second set of phonemes selected fromphonemes of the second language model; generate a second text queryincluding the first resolved word in the second language and the secondresolved word in the second language; determine a third numericalranking for the first resolved word in the second language and a fourthnumerical ranking for the second resolved word in the second languagebased on a likelihood of usage of the first resolved word in the secondlanguage and the second resolved word in the second language; determinea second composite score for the second text query based on summing thethird numerical ranking and the fourth numerical ranking; generate afirst search result for the first text query and a second search resultfor the second text query; rank the first search result and the secondsearch result according to the corresponding first composite ranking andthe second composite ranking; and second input/output circuitryconfigured to provide the first search result and second search resultaccording to the determined order.
 63. The system of claim 62, whereinthe processing circuitry is further configured to: determine thatmetadata for a third search result indicates that the third searchresult corresponds to a third language; and in response to determiningthat metadata for a third search result indicates that the third searchresult corresponds to the third language, generate a third languagemodel for the third language for determining search results.
 64. Thesystem of claim 63, wherein the third language model is generated afterreceiving confirmation to apply the third language model for determiningsearch results, and wherein a prompt for receiving the confirmation isgenerated in response to determining that the metadata for the thirdsearch result indicates that the third search result corresponds to thethird language.
 65. The system of claim 64, wherein the processingcircuitry is further configured to: access a language setting, thelanguage setting including a setting language; compare the settinglanguage to the third language; and determine that the third languagedoes not correspond to the setting language.
 66. The system of claim 65,wherein the first input/output circuitry is further configured to:receive a second voice query; and processing circuitry is furtherconfigured to: identify a third audio segment and a fourth audio segmentbased on audible breaks in the second voice query; determine a firstresolved word in the third language and a second resolved word in thethird language by combining a third set of phonemes to the third audiosegment and fourth audio segment, respectively, the third set ofphonemes selected from phonemes of the third language model; andgenerating a third text query including the first resolved word in thethird language and the second resolved word in the third language. 67.The system of claim 62, wherein the processing circuitry is furtherconfigured to: determine a plurality of potential text queries, usingthe first language model, by applying different combinations of thefirst set of phonemes to the first audio segment and second audiosegment, respectively; rank the potential text queries according totheir respective composite scores; and identify a highest ranked textquery from the potential text queries based on the first composite scorehaving a highest ranking of the respective composite scores.
 68. Thesystem of claim 67, wherein the processing circuitry is furtherconfigured to generating a prompt to select the highest ranked textquery if multiple text queries of the potential text queries share thehighest ranking of the respective composite scores.
 69. The system ofclaim 68, wherein the processing circuitry is further configured toaccess a composite score threshold; and the second input/outputcircuitry is further configured to provide the highest ranked searchresult for the first text query in response to determining that acorresponding composite score is above the composite score thresholdscore.
 70. The system of claim 69, wherein the processing circuitry isfurther configured to: access first metadata for the highest rankedsearch result; automatically determine that the first metadata for thehighest ranked search result indicates that the highest ranked searchresult does not correspond to the first language; and the secondinput/output circuitry is further configured to not provide the highestranked search result for the first text query in response to determiningthat the first metadata for the highest ranked search result indicatesthat the highest ranked search result does not correspond to a firstlanguage.
 71. The system of claim 62, wherein the processing circuitryis further configured to: determine, for a third text query, the firstresolved word in the first language and the second resolved word in thesecond language by combining the first set of phonemes to the firstaudio segment and the second set of phonemes to the second audiosegment; determine a fifth numerical ranking for the first resolved wordin the first language and a sixth numerical ranking for the secondresolved word in the second language based on a likelihood of usage ofthe first resolved word in the first language and the second resolvedword in the second language; and determine a third composite score forthe third text query based on summing the fifth numerical ranking andthe sixth numerical ranking.